AITopics | constrained episodic reinforcement

Collaborating Authors

constrained episodic reinforcement

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Constrained episodic reinforcement learning in concave-convex and knapsack settings

Neural Information Processing SystemsDec-24-2025, 12:47:52 GMT

We propose an algorithm for tabular episodic reinforcement learning with constraints. We provide a modular analysis with strong theoretical guarantees for settings with concave rewards and convex constraints, and for settings with hard constraints (knapsacks). Most of the previous work in constrained reinforcement learning is limited to linear constraints, and the remaining work focuses on either the feasibility question or settings with a single episode. Our experiments demonstrate that the proposed algorithm significantly outperforms these approaches in existing constrained episodic environments.

concave-convex and knapsack, constrained episodic reinforcement, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)

Add feedback

Review for NeurIPS paper: Constrained episodic reinforcement learning in concave-convex and knapsack settings

Neural Information Processing SystemsFeb-5-2025, 06:20:18 GMT

Weaknesses: My major concerns: 1. line 248 suggested linear programming could be used in ConPlanner, but instead the experiment tested on different unconstrained RL planners under Lagrangian heuristic. I think the papers should have compared results of different constrained problem solver. While theoretical proof was plenty, the paper didn't provide any empirical support, making this method less intuitive. Although the paper claimed they compared the proposed framework with other concave-convex approaches, the problems they experimented on didn't seem to be concave-convex. Grid world problem such as Mars rover applied in the paper has linear constraints instead of convex ones.

concave-convex and knapsack, constrained episodic reinforcement, neurips paper, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Review for NeurIPS paper: Constrained episodic reinforcement learning in concave-convex and knapsack settings

Neural Information Processing SystemsFeb-5-2025, 06:20:18 GMT

While it is true that constraints can typically be made part of the normal optimisation process in RL, by encapsulating them into the reward function, it can often be much easier to specify constraints directly, which is the setting this paper considers. The reviewers were positive about the motivation and execution of this paper, and were all in favour of accepting the paper. I would suggest already motivating this setting, at least somewhat, in the abstract, to help interesting readers find and appreciate this paper more easily.

concave-convex and knapsack, constrained episodic reinforcement, neurips paper

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Constrained episodic reinforcement learning in concave-convex and knapsack settings

Neural Information Processing SystemsOct-11-2024, 05:53:45 GMT

concave-convex and knapsack, constrained episodic reinforcement, episodic reinforcement, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.99)

Add feedback